Machine Learning for Online Algorithm Selection under Censored Feedback

نویسندگان

چکیده

In online algorithm selection (OAS), instances of an algorithmic problem class are presented to agent one after another, and the has quickly select a presumably best from fixed set candidate algorithms. For decision problems such as satisfiability (SAT), quality typically refers algorithm's runtime. As latter is known exhibit heavy-tail distribution, normally stopped when exceeding predefined upper time limit. consequence, machine learning methods used optimize strategy in data-driven manner need deal with right-censored samples, that received little attention literature so far. this work, we revisit multi-armed bandit algorithms for OAS discuss their capability dealing problem. Moreover, adapt them towards runtime-oriented losses, allowing partially censored data while keeping space- time-complexity independent horizon. extensive experimental evaluation on adapted version ASlib benchmark, demonstrate theoretically well-founded based Thompson sampling perform specifically strong improve comparison existing methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Learning under Delayed Feedback

Online learning with delayed feedback has received increasing attention recently due to its several applications in distributed, web-based learning problems. In this paper we provide a systematic study of the topic, and analyze the effect of delay on the regret of online learning algorithms. Somewhat surprisingly, it turns out that delay increases the regret in a multiplicative way in adversari...

متن کامل

Efficient Online Learning under Ban- dit Feedback

In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm and extend these results to bandits with arbitrary structure that is known to the decision maker. In these settings, we derive problem specific regret lower bounds and propose both an asymp...

متن کامل

Reinforcement Learning for Automatic Online Algorithm Selection - an Empirical Study

In this paper a reinforcement learning methodology for automatic online algorithm selection is introduced and empirically tested. It is applicable to automatic algorithm selection methods that predict the performance of each available algorithm and then pick the best one. The experiments confirm the usefulness of the methodology: using online data results in better performance. As in many onlin...

متن کامل

Modern Machine Learning for Automatic Optimization Algorithm Selection

Optimization software is commonly used to solve simulation-based problems such as optimal design and control, model parameter estimation, and best/worst-case scenario identification. While the value of such software is widely recognized, user feedback indicates that these tools are difficult for nonexperts to use. In particular, users are unfamiliar with the details of the plethora of available...

متن کامل

Machine Learning for Subproblem Selection

Subproblem generation, solution, and recombination is a standard approach to combinatorial optimization problems. In many settings identifying suitable subproblems is itself a significant component of the technique. Such subproblems are often identified using a heuristic rule. Here we show how to use machine learning to make this identification. In particular we use a learned objective function...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i9.21279